Evaluating fuzzy clustering for relevance-based information access
نویسندگان
چکیده
This paper analyzes the suitability of fuzzy clustering methods for the discovery of relevant document relationships, motivated by the need for enhanced relevancebased navigation of Web-accessible resources. The performance evaluation of a modified Fuzzy c-Means algorithm is carried out, and a comparison with a traditional hard clustering technique is presented. Clustering precision and recall are defined and applied as quantitative evaluation measures of the clustering results. The experiments with various test document sets have shown that in most cases fuzzy clustering performs better than the hard k-Means algorithm and that the fuzzy membership values can be used to determine document relevance and to control the amount of information retrieved to the user.
منابع مشابه
A Hybrid Time Series Clustering Method Based on Fuzzy C-Means Algorithm: An Agreement Based Clustering Approach
In recent years, the advancement of information gathering technologies such as GPS and GSM networks have led to huge complex datasets such as time series and trajectories. As a result it is essential to use appropriate methods to analyze the produced large raw datasets. Extracting useful information from large data sets has always been one of the most important challenges in different sciences,...
متن کاملFuzzy clustering of time series data: A particle swarm optimization approach
With rapid development in information gathering technologies and access to large amounts of data, we always require methods for data analyzing and extracting useful information from large raw dataset and data mining is an important method for solving this problem. Clustering analysis as the most commonly used function of data mining, has attracted many researchers in computer science. Because o...
متن کاملSystematic literature review of fuzzy logic based text summarization
Information Overloadrq is not a new term but with the massive development in technology which enables anytime, anywhere, easy and unlimited access; participation & publishing of information has consequently escalated its impact. Assisting userslq informational searches with reduced reading surfing time by extracting and evaluating accurate, authentic & relevant information are the primary c...
متن کاملImage retrieval by fuzzy clustering of relevance feedback records
We present an image retrieval method based on the accumulated user relevance feedback records. Our method conducts the semisupervised fuzzy clustering on the records, and the subsequent information filtering within the target cluster is performed to guide the refinement of query parameters. During information filtering, both the user’s relevance evaluation and the corresponding query image of t...
متن کاملFuzzy Labeled Neural Gas for Fuzzy Classification
We extend neural gas for supervised fuzzy classification. In this way we are able to learn crisp as well as fuzzy clustering, given labeled data. Based on the neural gas cost function, we propose three different ways to incorporate the additional class information into the learning algorithm. We demonstrate the effect on the location of the prototypes and the classification accuracy. Further, w...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003